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ABSTRACT 



In an example embodiment, a method of delivering a com- 
mand from an initiator device also transfers data identified 
by the command to a target device. The data is transferred 
between the initiator device and the target device according 
to a selected maximum pay load size. The method includes 
determining whether or not the size of the data associated 
with the command is greater than the selected maximum 
payload size. If the size of the data associated with the 
command is not greater than the selected maximum payload 
size, then a block is transferred to or from the target device 
which includes the command and all of the data associated 
with the command. If the size of the data associated with the 
command is greater than the selected maximum payload 
size, then a block is transferred to or from the target device 
which includes the command, an amount of data associated 
with the command equal to the selected maximum payload 
size and an indication that not all of the data associated with 
the command was included in the transferred block. 

31 Claims, 5 Drawing Sheets 
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EFFICIENT COMMAND DELIVERY AND to be written within a block A within the system memory 

DATA TRANSFER SM. TTie processor P transfers a command to the register on 

the PCI lyO adapter card A via a path over the system bus, 
PCI bus bridge, and PCI bus. This tells the I/O adapter card 

BACKGROUND 5 ^ ihat a new command has been issued. I/O adapter card A 

1 Field of the Invention decipher that command and then read system memory 

THIS invention relates generaUy to methods and apparatus to obtain the address of the write command. It must also 

for transferring commands and associated data blocks. In ^^^^ a pomter, which is the va^ue represenUng an address 

particular, the presem invention relates to methods and within the systern memoir SM where the data ^aated 

apparatus for efficiently transferring commands and their ^° with the command can be found^ (The pomter may be virtual 

associated data between various devices in a network or in Physical and the location of the data is not necessarUy 

a server architecture. conuguous with the location of the command. Indeed, the 

^ r\ • cii-ni.jA_. data may be split, requiring a Scatter/Gather List (SGL) to 

2. Descnption of the Related Art describe the locatioi^ of the data.) THe I/O adapter card A 

The latency incurred when transferring data can greaUy ^^en goes to the address of system memory SM pointed to 

diminish the perfora:iance of netwoiics and server architec- ^ the pointer. The block of data A is read from the system 

tures since the transferring and the transferee input/output 3^ ^^^^^ ^^^^ ^ 

(I/O) devices are usuaUy unable to engage in other opera- ^^^^^^^ ^^^^ f^^^^^es. "Hie data is then subsequently written 

tions until the data transfer is complete. This latency is ^jie I/O adaptor card A to the hard disk HD. Even if the 

longer and even more compUcated in networks and server p^^c^ssor sets aside known areas for the commands in 

architectures than m other computer systems because there 5^^^, memory SM so that the I/O adaptor card A always 

is so much competition for network and server resources ^^^^^^ j^e command, the I/O adaptor canl 

mcluding system memory, processor<s) and multiple I/O ^^^^j j^e write command to know where 

devices. This can be quite disadvantageous in networks and ^^^^ ^^^^^^ fo^^ ^^e fetches to obtain the 

server architectures where a large number of data blocks are ^j^^^ 

frequently transferred between the processor, memory and * • -i j i_ n j 

1 ^-fl- . i/in. J • j; .u ^ * ui 1 f A similar procedure occurs when the processor P reads a 

several different I/O devices and/or the data blocks are of , , , ^ * r . . 1 1- 1 ,^t^ ■ . j 

■J 1 J ir . • I J J .u 1 t f «: • • blodc of data from the hard disk HD, I.e., the adapter card 

widely different sizes. Indeed, the lack of efBciency in ^ . , , 1 i- . -i- /i , . 

r • J * ui 1 u 1 «• . 11 A would store the block of data withm a block B within the 

trans ferrme data blocks may have a larger effect on overall . • i- • . 

- .1. .i_ J .i_ r u system memory SM, then pass an mdication to the processor 

performance than the speed or other performance charac- ^ ^ 1 . j / •_ . . j- t tit^ c . 

f . r 1 . • .1 u-» -^u p that the read process from the hard disk HU has been 

teristics of the elements in the network or server architec- ^-.j. .Luin 

1 u *u . .u u -J/ J . J nmshed, whereupon the processor P can access the block a 

ture. It also may be that the buses and/or I/O adaptor cards - , . I , . ■ . , t 

f/i^ -1 • . .u .u u 4*1 1 within the system memory SM to obtain the data. Such a 

connecting I/O devices to the processor are the bottleneck . ; , . . „ „ - 

and the j^rformance of these I/O subsystem t»mponeats '^MjeaUonal procedure (Ulustrated generaUy in FIG. 3) of 

J . ^ ■ J sending a command with pomter (step 1), waitmg tor and 

needs to be improved. „ . * r j . * S\ j u »i 

^ . , . „ . ,.11 receiving a request for data (step 2) and subsequently 

Conventional servers typically have multiple adaptor ^^^^ ,^ ^^ ^^^^ ^ 3) ^as 

cards, each of which usually supports multiple ^0 devices. substantial inherent latencies and delays. The procedure is 

A server may have a significam number of I/O devices inefficient and slows down the entire system since 

configured m a load/store configuration such as shown in processor cycles wiU pass before the data transfer is 

FIG. 1. Even though the processor may perform optimally, 40 comoleted 
the performance of the server is still less than optimum 

because the I/O devices in the server may be of radically SUMMARY 
different types, store different kinds of data and/or vary from 

each other in the addressing sequence by which the data The present invention is directed to the delivery of 
blocks containing the data are written and read out. For 45 commands and transfer of data associated with the com- 
example, a pre-recorded CD-ROM may store large contigu- mands. A method of delivering a command from an initiator 
ousblocksof image data and the read out of such image data device also transfers data identified by the command to a 
by an optical disk drive may consist of several smaller target device. The data is transferred between the initiator 
sequential reads. Another I/O device may store heavily device and the target device according to a selected maxi- 
fragmented user data and the data readout from such a 50 ™tim payload size. The method includes determining 
device rarely consists of large blocks of data. whether or not the size of the data associated with the 
More particularly, in the example of FIG. 1, there is command is greater than the selected maximum payload 
shown a processor P, system memory SM, I/O adaptor card size. If the size of the data associated with the command is 
A, hard disk HD, I/O adaptor card B, a network interface not greater than the selected maximum payload size, then a 
card NIC, I/O adaptor caitlC and a CD-ROM drive CD, all 55 ^ transferred to or from the target device which 
connected along an input/output bus, for example, a Periph- mcludes the command and aU of the data associated with the 
eral Component Interconnect (?Cl) synchronous bus as conunand. If the size of the data associated with the corn- 
described in the latest version of "PCI Local Bus "and is greater than the selected maximum payload size, 
Specification, Revision 2.1" set forth by the PCI Special then a block is transferred to or from the target device which 
Interest Group (SIG) on Jun. 1, 1995. The PCI architecture 60 "^eludes the command, an amount of data associated with 
provides the most common method currently used to extend the command equal to the selected maximum payload size 
computer systems for add-on arrangements (e.g., expansion and an indication that not aU of the data associated with the 
cards) with new disk memory storage capabilities. command was included in the transferred block. 

In this load/Store configuration, taking a write command ^^^^ DESCRIPTION OF THE DRAWINGS 

for example, suppose the processor P wishes to write a block 65 

of data within the hard disk HD. First, as shown in FIG. 2, The foregoing and a better understanding of the present 

the processor P stores the command and its associated data invention will become apparent from the following detailed 
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description of example embodiments and the claims when data block in system memory and to optimize the coordi- 

read in connection with the accompanying drawings, all nation of the transfer of I/O data blocks. For a disk drive, the 

forming a part of the disclosure of the invention. While the data block is the unit of addressing and data transfer If the 

foregoing and following written and illustrated disclosure value of one byte is to be updated on a disk, then the data 

focuses on disclosing example embodiments of the 5 transfer would include a block of data (512 bytes, for 

invention, it should be clearly understood that the same is by example) that contains the byte of interest The byte value in 

way of fllustraiion and example only and is not to be taken ^ « ^OPY of ibe block held m memory would be updated, and 

bv way of limitation ^^^^ ^^^^ ^ transferred from memory to the 

^ ^ . , t J- .J- L • drive, overwriting the old block stored on the disk. However, 

HG. 1 is a gcnerahzed block diagram providmg a basic ^^^^^ according to the example embodiment is not 

aiustralion of a server architecture havmg a conventional 10 limited in its application to disk drives or storage devices. In 

load/store configuration. particular, the method according to the example embodiment 

FIG. 2 is a somewhat more detailed diagram of a server may be useful for transferring data among computers and 

architecture showing the conventional transfer of data other devices on a network since data latency is critical in 

blocks A and B from a CPU to a hard disk. such environments. The data may be transferred in blocks of 

FIG. 3 is a chart illustrating the steps of a conventional different sizes depending upon, for example, the target 

write operation in which an 1/0 data block is transferred device, the transfer protoco l (such as, for example, Ethern et 

from an initiator to a target device on a network. packets ), etc. 

FIG. 4 is a diagram illustrating a switched fabric configu- One example application of the invention is in a processor 

ration in which the command delivery and data transfer 20 chipset incorporated in the input/output control circuit of 

method according to an example embodiment of this inven- a sfitvex idcvice to operate in conjunction with a processor, 

lion may be advantageously performed. such as the Intel Pentium II Xeon'^" or Pentium in Xeon™ 

RG. 5 is a Chan generally iUustrating a conventionaUy processor. However, such an embodiment is but one possible 

implemented solution intended to address the delays in data example of the invention which may. of course, be apphed 

transfers 25 ^ computer having a processor and an input/output 

n^^. 1 !»• ♦jj* device and indeed in any server architecture where an 

FIG. 6 shows another conventional solution mtended to . . . . . j j- j » ui 1 . p 

, , , , . , - improvement m wntmg and readme data blocks to or from 

address delays m data transfers. ^^^.^ ^ ^^^.^^^ ^^^^^^^^ ^^^^ 

^u^. ^ ^Tv '° ' h'"'^ H One possible appUcation of the invention is in a server 

method embodiment to efficiently dehver commands and .^chiteciure with the switched fabric configuration shown in 

transfer data. ^ ^^^^ particularly, the configuration is preferably of 

HGS. 8 and 9 show examples of the distribution of data ^ Next Generation l/Oj CHS!0) architecture aUowing remote 

transfer sizes and the selection of maximum payload sizes dSect memory access anOie server employs message 

according to one possible embodiment of the mvention. passing in com pUanoe wittU hc-NGIOstandard, \fersion 1.0, 

FIGS. 10 and 11 are graphical illustrations of command published Jul. 22, 1999. (This application of the method and 

delivery and data transfer depending upon whether the size apparatus of the invention is exemplary only. The method 

of the data transfer is greater than the selected maximum and apparatus of the invention arc also applicable to non- 

payload size in an example embodiment of the invention. switched point-to-point connection links in other configu- 

HGS. 12-14 illustrate different possible channel imple- rations or networks.) Such a server architecture might 

mentations which can alternatively be used with the example 40 include, as shown in FIG. 4, a host having at least a 

embodiment of the invention. processor P such as an Intel Pentium II Xeon""*" or Pentium 

III XEON™ processor, system memory .SM, and an input/ 

DETAILED DESCRIPTION ^^jp^j yQ containing a channel adaptor CA. These 

An example embodiment of the present invention seeks to devices of the host may be all connected along a single bus 

decrease the inefficiencies of the transfer of data to input/ 45 as shown, or by multiple buses, or may be connected by 

output devices in a server architecture, such as what occiu^ point-to-point connections. The external data transfers are 

when an I/O data block is transferred to or from a mass optimized through the I/O unit at all times. This helps avoid 

storage device such as a hard disk. In particular, PQ the processor or other elements from having to expend 

compliant I/O adapters cannot accomplish dau transfers system resources to accomplish transfer of I/O data blocks 

without the multiple steps discussed above. Computer sys- 50 since there may be access conflicts with other functions. Of 

lems generally have a processor, associated system memory, course, a server can operate as a host when initiating a data 

an input/output (I/O) device, and at least one bus, such as a transfer or it can operate as a target device when receiving 

PCI bus, connecting these components. A server is a type of a data transfer initiated by another device, 

computer system having an architecture or otherwise The channel adapter CA of the I/O unit, in turn, is 

designed to be able to support multiple 1/0 devices and to 55 connected to a switching fabric SF, which may contain many 

transfer data with other computer systems at high speed. different switches SW and redundant paths throughout the 

(Due to recent advances in the performance and flexibility of fabric, such that a plurality of messages can be traveling 

computer systems, many modem computers are servers through the switching fabric at any given time. Accordingly, 

under this definition.) Although many servers currently when the processor P issues a write command, for example, 

utilize PCI buses, the example embodiment of the invention 60 ibe processor P now simply passes the same to the channel 

set forths a data transfe r where the transferee device has adaptor CA which injects it into the switched fabric SF, such 

remote dirccF m emory' access ( RDMA) to virtual addresses^ that the processor P does not have to wait for processing of 

th us enablmg protected, tfli'ge l-mana ged data transfer. the and locking of the system bus, but instead go on to 

The example embodiment attempts to reduce the latency perform other processing operation until the processing is 

when an element of the host server, such as one of the 65 completed. 

processors, attempts to write a data block to the hard disk According to the present invention, the channel is any 

drive either for the execution of instructions or to store the means of transferring data, including but not limited to 
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virtual channels, used to transfer data between two end- transfer) for the left-over data as indicated by (2) in FIG. 7. 

points. While the example embodiment is an NGIQ imple- The flag can be included within the header of the command, 

mentatioa and thus supports the channel dcdnition provided and can be any type of flag, e.g., simply a single bit 

in the specification identified above, the present invention is indicating that there is left-over data or can be more complex 

not so limited. In accordance with the implantation in the s such as an address of the left-over data. In any case, the 

NGIO specification, once injected into the switched fabric request for left-over data is immediately issued from the 

SF, the write command travels through the switches and largetdevice without waiting to process the other parts of the 

eventually arrives at a second channel adapter CA where it command or the data in the small data block. After the 

can be given to an 1/0 adaptor card A where it is subse- request is forwarded to the initiator, the target device can 

quenlly written to the hard disk HD or to a network interface jq start to process the command together with the small data 

where it is subsequently transferred to another computer block (3). The left-over data is transmitted aaoss the 

device on a connected network (not shown). Accordingly, switched fabric (4), again without waiting for the processing 

the inherent delays in deciphering the command and writing ^f the command and the smaU data block in step 3. However, 

of the data as required by the I/O adaptor card A are not ^ P°i°^ ^^^^V' ^^f^' r"^"*"^.*" u^'ii'P' 

experienced by the processor P which is on the other side of . , f^^^*^ expectation of the left-over data, preferably before 

the switching fabric, and can continue processing. As shown f ^ ^^^^"^ ^""^T t k k T P^^^'^* ,^^8^"^" 

. . ' c *• . ' less of whether the target has finished processmg the com- 

m FIG. 4. the message passing configuration can conlam a ^^^^ ^^^^^ ^^^^ ^ ^^^^ ^^^^^^^ ^^^^ 

plurality of channel adapters CA and host servers connected in,„3ediately grabbed. If the command and small data block 

to the switchmg fabric SF, such that there can be a multitude ^^e stiU being processed, the left-over data is stored in the 

ofdiflferent messages traveling through the fabric and where ^^^^ ^nd processed immediately after the command and 

aU of the hosts can continue operating while their messages small data block. If the command and small data block have 

are traveling through the switching fabric. already been processed, the left-over data is immediately 

Turning now to FIGS. 5-7, illustrating and describing a processed (5). Accordingly, a substantial portion of the 
plurality of examples for understanding the invention. One previous inherent delays are avoided, thus achieving the 
possible solution is shown in FIG. 5. More particularly, the 25 advantages of the present invention, 
data is limited to some predetermined size, e.g., a 2 kilobyte of course, the parameter of primary importance is the 
size. By limiting the data block size, the constraints on the selection of the amount of data, called the maximum payload 
target buffer are released since it is known which maximum, size, that can be transferred in one of the transfer blocks. If 
reasonable size of data is always going to be obtained, and the allocation of memory in a computer device was incon- 
thus it is easy to provide a smaller reasonably sized buffer in 30 sequential in terms of cost, power consumption, etc., then of 
the target device. The problem with such an arrangement is course an extremely large of memory could be provided, 
that there is a limitation in block transmission size, thus since that is not the case, one important point of the present 
requiring large blocks of data to be sliced into smaller invention is that an advantageous memory size/latency 
blocks, with a header identifier attached to each of the tradeoff is made by proper selection of the maximum pay- 
smaller blocks. Such smaller blocks are repeatedly sent until 35 load size for a target device and data transfer. In the two 
all of the data has been transferred. This solution increases transfer size graphs shown in FIGS. 8 and 9, the left hump 
the number of data transfers, the amount of data being or peak in the graphs display situations in which the data 
transmitted across the network, is fixed to the size of the block to be transfered is relatively smaU in size and the 
buffer, and also represents significant record-keeping prob- commands are sent with a relatively small amount of data, 
lems. 40 whereas the right-hand peak represents situations where the 

FIG. 6 illustrates the solution of how, in order to avoid the data block to be transfered is relatively large in size and the 

ping pong back and forth shown in FIG. 3, it would be nice command is sent with a large amount of data. Such a data 

to be able to transfer the command and all of the data at once. transfer is problematic as illustrated in FIG. 6. Accordingly, 

Such is passible with small data blocks. However, a problem in order to improve the eflBciency of the present invention, 

is encountered where the data block is excessively large. 45 the host device selects the maximum payload size B of the 

More particularly, if the data block, for example, is 1 or 2 data block which can be transmitted together with the 

megabytes, a problem exists in that a 1 or 2 megabytes buffer command. The block size can be selected in any manner, 

memory size would then have to be provided within the However, it may optimally be based on: 1) the distribution 

target device. Such an excessive memory requirement is not of the sizes of the data blocks transferred to and from the 

cost effective. Accordingly, transfer of the command and all 50 target device; 2) the typical number of outstanding com- 

of the data at once may not be feasible in many instances. mands; and/or 3) the typical processing time for a command. 

Accordingly, FIG, 7 shows a large data block which is Preferably, the selected maximum payload size is slightly to 

divided into smaller size data blocks and a left-over data the right of or greater in size than the highest peak repre- 

block. Also shown is a series of operations with the present senting the most typical sized data block. As shown in FIGS, 

invention. More specifically, first as indicated by (1), a 55 8 and 9, the selected maximum payload size B in FIG. 8 

command plus a certain amount of data is transmitted across would be to the right of the right-hand peak A^ since that is 

the switched fabric to the target. Accordingly, at least a the highest peak and the selected block size in FIG. 9 would 

portion of the data is immediately transferred with the be slightly to the right of the left-hand peak Aj since that it 

command. If the data is smaller than the certain amount, the is the highest peak. The data transfer size distribution may 

entire data block can be transferred with the command, thus 60 be caused by, for example, different target devices or dif- 

resulting in an immediate advantage. However, if the data ferent types of data files. Alternatively, the selected maxi- 

block is bigger than the certain amount which can be mum payload size B could be changed according to the most 

transmitted with the command, the target immediately rec- recent data transfers with any number of different algorithms 

ognizes that additional data will be required (e.g., by refer- and weighting factors, moving averages, etc., being avail- 

encing a flag in the header), whereupon the target can 65 able to determine and perhaps, dynamically change the 

immediately issue a request (or multiple requests which selected maximum payload size B so that it is determined in 

transfer the data in multiple pieces in the case of a very large firmware or software rather than being preset in hardware. 
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RGS. 10 and 11 illustrate the uansferred data blcxks 
coatainiDg the command and data blocks in relationship to 
the selected maximum payload size B. (The transfer block 
may also contain a header which is not shown for purposes 
of clarity.) FIG. 10 is a first example wherein the data to be 
transferred is less than or equal in size to the selected 
maximum payload size B. FIG. 11 is a second example 
wherein the data to be transferred is greater in size than the 
selected maximum payload size B. 

HGS. 12-14 show several channel examples useful in 
understanding aspects of the invention. More particularly, 
turning first to FIG. 12, there is shown a single channel 
which is used to transmit across the channel adapters. More 
particularly, first the command and small data blodc are 
transmitted, and subsequently the left-over data is transmit- 
ted along the same single channel. Such arrangement is 
advantageous in that there is only a single channel, but is 
disadvantageous in that if a huge block of data (e.g., 2 
gigabytes) is to be transmitted, such huge data block trans- 
mission will lie up the single chaimel, thus blocking all other 
messaging. 

Accordingly, a further advantageous arrangement is 
shown in the channel example shown in FIG. 13. More 
specifically, such example includes two channels. First there 
is shown a messaging channel which is used exclusively for 
transmitting the command and small data blocks. Further 
shown is a bulk data channel which can be used to send 
left-over data. Accordingly, if there is a huge data block to 
be transmitted across the fabric, the bulk data channel is 
used to transmit such left-over data, thus not tying up the 
messaging channel which can continue to operate and trans- 
mit messages across the fabric. Dual channels are employed, 
but is advantageous in that bulk data or left-over data 
transmission will never tic up the messaging channel 
Although the invention can be practiced using a single 
channel, it is generally more advantageous to use multiple 
channels. 

In FIG. 14, there is shown a single messaging channel and 
a plurality of data channels 1, 2, 3. With such arrangement, 
left-over data can be sent over a single one of the channels 
1, 2, 3. Such is advantageous in that only one data channel 
will be tied up transmitting any bulk or left-over data. In 
contrast, multiple channels can be used in parallel for faster 
transmission of the bulk or left-over data. More specifically, 
the bulk or left-over data can be cut up into equally sized 
blocks and transmitted over various channels. 

There are different possible points in the server architec- 
ture to implement the delivery method. The first possible 
implementation is at a somewhat centralized (but not shared) 
location. This implementation takes advantage of the fact 
that I/O adaptor cards are a standard component of input/ 
output subsystems and generally don't include any special- 
ized circuitry or software for effectuating the described 
method of transferring data blocks. In the context of this 
application, they can be considered "dumb" I/O cards. An 
example implementation of the invention uses such dumb 
I/O cards because they are standardized and less expensive 
than non-standard I/O cards, and performs the method 
elsewhere in the network configuration. The method may 
support a different block size B for each I/O device. In such 
case, it then looks at the block size for the respective 1/0 
device and the data transfer is carried out per I/O device 
based on the size of the data blocks generally transferred for 
operation of that particular I/O device. Also, for example, 
one I/O device may be a CD-RW disk drive and the other 
may be a high speed communications interface (e.g., an 
asynchronous transfer mode (ATM) interface). Preferably, 
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the method is implemented by firmware or software, 
although it may be accelerated with hardware support. 

A set system length parameter is tuned to the target device 
or some basic unit. Preferably, a basic unit is set for the data 

5 block size B which is equal to a multiple of the data storage 
format of the I/O device. For example, in a disk drive storing 
data in 1 kilobyte sectors, it is preferable that the predeter- 
mined block size be a multiple of kilobytes. However, the 
data blocks may be of different sizes controlled according to 

10 the firmware or software. The firmware or software may or 
may not maintain an internal history of data transfers to each 
1/0 device and adjust the block size B according to the 
history to provide a further level of adaptability to adjust to 
operating conditions. The block size B can thus respond to 

15 conditions of the I/O device at the time of operation, rather 
than a static design made at decision time or boot time. 

Although an example embodiment, the invention is not 
limited to the switched fabric configuration or to a host 
server as illustrated in FIG. 4. Indeed, an advantage of the 
exemplary embodiments of the invention is that they are 
particulariy useful and widely adaptable to any I/O device 
having latency in data transfer operations. In this way, data 
transfers can be efficient in both a server that has a great deal 
of network I/O interfaces and a server that has a great deal 
of I/O disk interfaces. The example embodiments will 
automatically adapt to transfer characteristics in which large 
blocks of data are generally transferred rather than small 
blocks of data. Indeed, the example embodiments will adapt 
to any I/O data interface. There will be no need for a network 

^° administrator to select an appropriate data transfer buffer 
size in the setup configuration. 

Other features of the invention may be apparent to those 
skilled in the art fi-om the detailed description of the example 
embodiments and claims when read in connection with the 
accompanying drawings. While the foregoing written and 
illustrated disclosure focuses on disclosing example 
embodiments of the invention, it should be understood that 
the same is by way of illustration and example only, is not 
to be taken by way of limitation and may be modified in 
learned practice of the invention. While the foregoing has 
described what are considered to be example embodiments 
of the invention, it is understood that various modifications 
may be made therein and that the invention may be imple- 
mented in various forms and embodiments, and that it may 
be applied in numerous applications, only some of which 
have been described herein. It is intended by the following 
claims to claim aU such modifications and variations. 
What is claimed is: 

1. A method of delivering a command from an initiator 
device and transferring data associated with the command to 
a target device, said method comprising: 
selecting a maximum payload size of data to be trans- 
ferred at one time between said initiator device and said 
target device; 

determining whether or not the size of the data associated 
with the command is greater than said selected maxi- 
mum payload size; 

if the size of the data associated with the command is not 
60 greater than said selected maximum payload size, then 
transferring a block to the target device which includes 
the command and all of the data associated with the 
command; and 

if the size of the data associated with the command is 
65 greater than said selected maximum payload size, then 
transferring a block to the target device which includes 
the command, an amount of said data associated with 
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said comniand equal to the selected maximum pay bad 
size aad an indicatioo that not all of the data associated 
with said command was included in the transferred 
blodc. 

2. The method recited in claim 1, wherein said indication 
is a flag included as part of the command. 

3. The method recited in claim 2, wherein the target 
device, in response to the flag, issues a request for the part 
of the data associated with the command and not included in 
said transferred block. 

4. The method recited in claim 3, wherein the transferred 
block is transferred over a first channel and the part of the 
data associated with the command and not included in said 
transferred block is transferred in another block over a 
second channel. 

5. The method recited in claim 3, wherein the transferred 
block is transferred over one channel and the part of the data 
associated with the command and not included in said 
transferred block is transferred in other blocks over a plu- 
rality of other channels. 

6. The method recited in claim 5, wherein the transfer of 
said other blocks over said plurality of other channels begins 
without waiting for the transfer of said transferred block 
over said one channel. 

7. The method recited in claim 1, wherein said indication 
is a pointer providing the memory address(es) for the part of 
the data associated with the command and not included in 
said transferred block. 

8. The method recited in claim 7, wherein the target 
device, in response to the pointer, fetches the part of the data 
associated with the command and not included in said 
transferred block. 

9. The method recited in claim 8, wherein the target 
device fetches the part of the data associated with the 
command and not included in said transferred block in 
multiple pieces. 

10. The method recited in claim 8, wherein the transferred 
block is transferred over a first channel and the part of the 
data associated with the command and not included in said 
transfened block is fetched in another block over a second 
channel. 

U. The method recited in claim 8, wherein the transferred 
block is transferred over one channel and the part of the data 
associated with the command and not included in said 
transferred block is fetched in other blocks over a plurality 
of other channels. 

12. The method recited in claim 11, wherein the fetching 
of said other blocks over said plurality of other channels 
begins without waiting for the transfer of said transferred 
block over said one channel. 

13. A host device configured to initiate the transfer of data 
to a target device, said host device comprising: 

a processor; 

a memory accessible by said processor; and 
an input/output control circuit connected to said processor 
and said memory, said input/output control circuit 
adapted to control the transfer of data to said target 
device by: selecting a maximum payload size of data to 
be transferred between said host device and said target 
device in response to a command from said processor; 
determining whether or not the size of the data asso- 
ciated with the command is greater than said selected 
maximum payload size; if the size of the data associ- 
ated with the command is not greater than said selected 
maximum payload size, then transferring a block to 
said target device which includes the command and all 
of the data associated with the command; and if the size 
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of the data associated with the command is greater than 
said selected maximum payload size, then transferring 
a block to said Urgei device which inchides the 
command, an amount of said data associated with said 
5 command equal to the selected maximum payload and 
an indication that not all of the data associated with said 
command was included in the transferred block. 
14. The host device recited in claim 13, wherein said 
indication is a flag included as part of the command. 
10 15. The host device recited in claim 14, wherein said 
input/output control circuit transfers, in response to a request 
received from said at least one target device, at least one 
additional block containing the part of the data associated 
with the command and not included in said transferred 
15 block. 

16. The host device recited in claim 15, wherein the 
transferred block is transferred over a first channel and the 
part of the data associated with the command and not 
included in said transferred block is transferred in another 

20 block over a second channel. 

17. The host device recited in claim 15, wherein the 
transferred block is transferred over one channel and the part 
of the data associated with the command and not included in 
said transferred block is transferred in other blocks over a 

25 plurality of other channels. 

18. The host device recited in claim 17, wherein the 
transfer of said other blocks over said plurality of other 
channels begins without waiting for the transfer of said 
transferred block over said one channel. 

30 19. The host device recited in claim 13, wherein the 
input/output control circuit controls the transfer of data to at 
least two target devices and selects a different maximum 
payload size for each one of the respective target devices. 

20. The host device recited in claim 13, wherein the 
35 input/output control circuit selects the maximum payload 

size for said at least one target device based on the distri- 
bution of block sizes of data transfers to said at least one 
target device. 

21. The host device recited in claim 13, wherein said 
40 indication is a pointer providing the memory address(es) for 

the part of the data associated with the command and not 
included in said transferred block. 

22. The host device recited in claim 21, wherein the target 
device, in response to the pointer, fetches the part of the data 

45 associated with the command and not included in said 
transferred block. 

23. The host device recited in claim 22, wherein the 
input/output control circuit has a buffer, the part of the data 
associated with the command and not included in said 

50 transferred block is saved in said buffer and the target device 
fetches the data from said buffer. 

24. The host device recited in claim 22, wherein the 
transfeaed block is transferred over a first channel and the 
part of the data associated with the command and not 

55 included in said transferred block is fetched in another block 
over a second channel. 

25. The host device recited in claim 22, wherein the 
transferred block is transferred over one channel and the pari 
of the data associated with the command and not included in 

60 said transferred block is fetched in other blocks over a 
plurality of other channels. 

26. The host device recited in claim 25, wherein the 
fetching of said other blocks over said plurality of other 
channels begins without waiting for the transfer of said 

65 transferred block over said one channel. 

27. The host device recited in claim 26, wherein the host 
device is a server and the input/output control circuit con- 
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trols the transfer of data to and from data storage devices the oommand, an amount of said data associated with 

connected to said server or to data storage devices connected said command equal to the selected maximum payload 

to a network including said server. size and an indication that not all of the data associated 

28. A tangible medium storing a plurality of program with said command was included in the transferred 

instructions, said program instruaions causing a host device 5 block. 

U) cany out a method of delivering a command from the host 29. The tangible medium recited in claim 28, wherein the 

device and transferring data associated with the command to program instructions cause the host device to control the 

a target device, said method comprising: transfer of data to at least two target devices and to select a 

selecting a maximum payload size of blocks to be trans- different maximum payload size for each one of the respec- 
fcrred between said initiator device and said target 1° jjyg target devices. 

device; 30. Jhe host device recited in claim 28, wherein the 

determining whether or not the size of the data associated program instructions cause the host device to select the 

with the command is greater than said selected maxi- maximum payload size for said at least one urget device 

mum payload size; based on the distribution of block sizes of data transfers to 

if the size of the data associated with the command is not said at least one target device, 

greater than said selected maximum payload size, then 31. The host device recited in claim 30, wherein the 

transferring a block to or from the target device which program instructions cause the host device to select the 

includes the command and all of the data associated maximum payload size for said at least one target device 

with the command; and which is slightly greater than the block size which is a peak 

if the size of the data associated with the command is on the distribution of block sizes of data transfers, 
greater than said selected maximum payload size, then 

transferring a block to the target device which includes ♦ ♦ « ♦ ♦ 
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